SRMR variants for improved blind room acoustics characterization
نویسندگان
چکیده
Reverberation, especially in large rooms, severely degrades speech recognition performance and speech intelligibility. Since direct measurement of room characteristics is usually not possible, blind estimation of reverberation-related metrics such as the reverberation time (RT) and the direct-to-reverberant energy ratio (DRR) can be valuable information to speech recognition and enhancement algorithms operating in enclosed environments. The objective of this work is to evaluate the performance of five variants of blind RT and DRR estimators based on a modulation spectrum representation of reverberant speech with singleand multi-channel speech data. These models are all based on variants of the so-called Speech-to-Reverberation Modulation Energy Ratio (SRMR). We show that these measures outperform a state-of-the-art baseline based on maximum-likelihood estimation of sound decay rates in terms of root-mean square error (RMSE), as well as Pearson correlation. Compared to the baseline, the best proposed measure, called NSRMRk, achieves a 23% relative improvement in terms of RMSE and allows for relative correlation improvements ranging from 13% to 47% for RT prediction.
منابع مشابه
Session 2pSP: Acoustic Signal Processing for Various Applications 2pSP2. Towards blind reverberation time estimation for non-speech signals
Reverberation time (RT) is an important parameter for room acoustics characterization, intelligibility and quality assessment of reverberant speech, and for dereverberation. Commonly, RT is estimated from the room impulse response (RIR). In practice, however, RIRs are often unavailable or continuously changing. As such, blind estimation of RT based only on the recorded reverberant signals is of...
متن کاملImproved Finite Difference Schemes for a 3-D Viscothermal Wave Equation on a GPU
Viscothermal effects in air lead to a damping of high frequencies over time. Such effects cannot be neglected in large-scale room acoustics simulations for the full audible bandwidth. In this study, full-bandwidth room acoustics is modelled using a variant of the three-dimensional wave equation including viscothermal losses in air following from a simplification of the Navier-Stokes equations s...
متن کاملIssues for computer modelling of room acoustics in non-concert hall settings
The basic principle of common room acoustics computer models is the energy-based geometrical room acoustics theory. The energy-based calculation relies on the averaging effect provided when there are many reflections from many different directions, which is well suited for large concert halls at medium and high frequencies. In recent years computer modelling has become an established tool in ar...
متن کاملFree-field Virtual Psychoacoustic and Hearing Impairment: Paper 775 Does learning a room’s reflections aid spatial hearing?
Sound reflections are abundantly present in everyday environments; yet, our spatial hearing abilities are usually not impaired by them. One contributor to this robustness is the adaptation to the reflections after being repeatedly exposed to the room’s reverberation. The echo threshold, the delay at which a reflection starts being separately audible as an echo, increases with repeated exposure ...
متن کاملNoise and room acoustics distorted speech recognition by HMM composition
This paper presents a robust speech recognition method based on the HMM composition for the noisy room acoustics distorted speech. The method realizes an improved user interface such as the user is not encumbered by microphone equipments. The proposed HMM composition is obtained by naturally extending the HMM composition method of an additive noise to that of the convolutional room acoustics di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1510.04707 شماره
صفحات -
تاریخ انتشار 2015